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ID301248 

'VIDEO CODING METHOD AND CORRESPONDING DECODING METHOD" 

FIELD OF THE INVENTION 

The present Invention generally relates to the field of video compression and 
decompression and, more particularly, to a three-dimensional (3D) video coding method for the 
compression of a bitstream corresponding to an original video sequence that has been divided 
into successive groups of frames (GOFs) the size of which is N = 2 n with n = 0, 1, 2,..., said 
coding method comprising the following steps, applied to each successive GOF of said original 
sequence : 

a) a spatio-temporal analysis step, leading to a spatio-temporal multiresolution 
decomposition of the current GOF into 2 n low and high frequency temporal subbands, said step 
itself comprising : 

- a motion estimation sub-step ; 

- based on said motion estimation, a motion compensated temporal filtering sub- 
step, performed on each of the 2"" 1 couples of frames of the current GOF ; 

- a spatial analysis sub-step, performed on the subbands resulting from said 
temporal filtering sub-step ; 

b) an encoding step, said step itself comprising : 

- an entropy coding sub-step, performed on said low and high frequency temporal 
subbands resulting from the spatio-temporal analysis step and on motion vectors obtained by 
means of said motion estimation step ; 

- an arithmetic coding sub-step, applied to the coded sequence thus obtained and 
delivering an embedded coded bitstream. 

The invention also relates to a transmittable video signal generated by said coding 
method, to a method for decoding such a signal, and to coding and decoding devices for the 
implementation of said coding and decoding methods, 

BACKGROUND OF THE INVENTION 

From MPEG-1 to H.26L, standard video compression schemes were based on so- 
called hybrid solutions : an hybrid video encoder uses a predictive scheme where each frame of 
the Input video sequence is temporally predicted from a given reference frame, and the 
prediction error thus obtained by difference between said frame and its prediction is spatially 
transformed (the transform is for instance a bi-dimensional DCT transform) in order to get 
advantage of spatial redundancies. 

A different approach, later proposed, consists in processing a group of frames 
(GOF) as a three-dimensional (3D, or 2D + 1) structure and spatio-temporally filtering it in order 
to compact the energy in the low frequencies (as described for Instance in 'Three-dimensional 
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subband coding of video", CI. Podilchuk and al., IEEE Transactions on Image Processing, vol.4, 
n°2, February 1995, pp.125-139). The introduction of a motion compensation step in such a 3D 
subband decomposition scheme allows to improve the overall coding efficiency and leads to a 
spatio-temporal multiresolution (hierarchical) representation of the video signal thanks to a 

subband tree, as depicted in Fig.l. 

The illustrated 3D wavelet decomposition with motion compensation is similarly 
applied to successive groups of frames (GOFs). As shown in Fig.l, each GOF of the input video, 
including in the illustrated case eight frames Fl to F8, is first motion-compensated (NIC) in order 
to process sequences with large motion, and then temporally filtered (TF) using Haar wavelets 
(the dotted arrows correspond to a high-pass temporal filtering, while the other ones 
correspond to a low-pass temporal filtering). Three stages of decomposition are shown (L and H 
= fi^ stage ; LL and LH = second stage ; LLL and LLH = third stage). 

The high frequency temporal subbands of each level (H, LH and LLH in the above 
example) and the low frequency temporal subband(s) of the deepest one (LLL) are then 
spatially analyzed through a wavelet filter, and an entropy encoder allows to encode the 
wavelet coefficients resulting from this spatio-temporal decomposition. For example, the 2D- 
SPIHT, originally proposed by A. Said and W.A. Pearlman in "A new, fast, and efficient image 
codec 'based on set partitioning in hierarchical trees", IEEE Transactions on Circuits and Systems 
for Video Technology, vol.6, n°3, June 1996, pp.243-250, has been extended to the 3D wavelet 
decomposition, in order to efficiently encode the final coefficient bitplanes with respect to the 
spatio-temporal decomposition structure. 

However, all the 3D subband solutions suffer from the following drawback : since 
an entire GOF is processed at once, all the pictures in the current GOF have to be stored before 
being spatlo-temporally analyzed and encoded. The problem is the same at the decoder side, 
where all the frames of a given GOF are decoded together. In order to solve said problem, a 
european patent application has been filed by the applicant on June 28, 2002, with the 
registration number 02291621.7 (PHFR020065) . In said document, a low-memory solution has 
been proposed, in which a progressive reconstruction of the frames of a GOF of the sequence is 
performed, instead of a reconstruction of the whole GOF. 

Said low-memory solution was based on the following remarks. As illustrated in 
Fig.2 (In the case of a GOF of eight frames for the sake of simplicity of the figure), said frames 
Fl to F8 are grouped into four couples of frames CO to C3. At the end of the first step of the 
temporal decomposition of the original sequence, low frequency temporal subbands LO, LI, L2, 
L3 and high frequency temporal subbands HO, HI, H2, H3 are available. While the subbands HO 
to H3 are coded and transmitted, the subbands LO to L3 are further decomposed : at the end of 
this second step of the decomposition, tow frequency temporal subbands LLO, LLL and high 
frequency temporal subbands LHO, LH1 are available. Similariy, while the subbands LHO, LH1 
are coded and transmitted, the subbands LLO, LL1 are further decomposed and, at the end of 
the third step of decomposition (the last one in the illustrated case), a low frequency temporal 
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subband LLLO and a high frequency temporal subband LLHO are available and will be coded and 
transmitted. The whole set of transmitted subbands is surrounded by a black line in Rg.2. 

It has then been observed that only the subbands HO, LHO, LLLO and LLHO are 
needed to decode the first two frames Fl, F2 (i.e. the couple CO) of the GOF. Furthermore, the 
first subband HO contains some information only on these two first frames F1,F2. So, once these 
frames Fl, F2 are decoded, the first subband HO becomes useless and can be deleted and 
replaced : the next subband HI can be loaded in order to decode the next couple CI including 
the two frames F3, F4. Only the subbands HI, LHO, LLLO and LLHO are now needed to decode 
these frames F3, F4 and, as previously for HO, the subband HI contains some information only 
on these two frames F3, F4. So, once these two frames F3, F4 are decoded, the second 
subband HI can be deleted, and replaced by H2. And so on : these operations are repeated for 
F5,F6 and F7,F8 (in the general case, for all the successive couples of frames of the GOF). The 
bitstream (the illustrated organization of which is only an example that does not limit the scope 
of the invention at the decoding side) thus formed for each successive GOF may be encoded by 
means of an entropy coder followed by an arithmetic coder. In the illustrated specific example, 
the coded bitstream finally available (and transmitted or stored) successively comprises, for the 
current GOF, a header and the coding bits corresponding to the subbands LLLO, LLHO, LHO, 
LH1, HO, HI, H2 and H3. 

The low-memory solution proposed in the cited european patent application was 
then the following. The part of the coded bitstream corresponding to the current GOF is 
decoded a first time, but only the coded part that, in said bitstream, corresponds to the first 
couple of frames CO (the two first frames Fl and F2) - Le.the subbands HO, LHO, LLLO, LLHO - 
is, in fact, stored and decoded. When the first two frames F1,F2 have been decoded, the first H 
subband, referenced HO, becomes useless and its memory space can be used for the next 
subband to be decoded. The coded bitstream is therefore read a second time, in order to 
decode the second H subband, referenced HI, and the next couple of frames CI (F3,F4). When 
this second decoding step has been performed, said subband HI becomes useless and the first 
LH subband too (referenced LHO). They are consequently deleted and replaced by the next H 
and LH subbands (respectively referenced H2 and LH1), that will be obtained thanks to a third 
decoding of the same input coded bitstream, and so on for each couple of frames of the current 
GOF. 

This multipass decoding solution, comprising an iteration per couple of frames in a 
GOF, is detailed with reference to Rgs 3 to 6. During the first iteration, the coded bitstream 
CODB received at the decoding side is decoded by an arithmetic decoder 31, but only the 
decoded parts corresponding to the first couple of frames CO are stored, i.e. the subbands LLLO, 
LLHO, LHO and HO (see Rg.3). With said subbands, the inverse operations, with respect to those 
illustrated in Fig.l, are then performed : 

- the decoded subbands LLLO and LLHO are used to synthesize the subband LLO ; 
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- said synthesized subband LLO and the decoded subband LHO are used to 

synthesize the subband LO ; 

- said synthesized subband LO and the decoded subband HO are used to 

reconstruct the two frames Fl, F2 of the couple of frames CO. 

When this first decoding step is achieved, a second one can begin. The coded 
bitstream is read a second time, and only the decoded parts corresponding to the second couple 
of frames CI are now stored : the subbands LLLO, LLHO, LHO and HI (see Fig.4). In fact, the 
dotted information of Fig.4 (LLLO, LLHO, LLO, LHO) can be reused from the first decod.ng step 
(this is especially true for the bitstream information after the arithmetic decoding, because 
buffering this compressed information is not really memory consuming). With these subbands, 
the following inverse operations are now performed : 

- the decoded subband LLLO and LLHO are used to synthesize the subband LLO ; 

- said synthesized subband LLO and the decoded subband LHO are used to 

synthesize the subband LI ; 

- said synthesized subband U and the decoded subband HI are used to 

reconstruct the two frames F3, F4 of the couple of frames CI. 

When this second decoding step is achieved, a third one can begin similarly. The 
coded bitstream is read a third time, and only the decoded parts corresponding to the third 
couple of frames C2 are stored : the subbands LLLO, LLHO, LH1 and H2 (see Fig.5). As 
previously, the dotted information of Fig.5 (LLLO, LLHO) can be reused from the first (or second) 
decoding step. The following inverse operations are performed : 

- the decoded subbands LLLO and LLHO are used to synthesize the subband LU ; 

- said synthesized subband LL1 and the decoded subband LH1 are used to 

synthesize the subband 12; 

- said synthesized subband L2 and the decoded subband H2 are used to 
reconstruct the two frames F5, F6 of the couple of frames C2. 

When this third decoding step is achieved, a fourth one can begin similarly. The 
coded bitstream is read a fourth time (the last one for a GOF of four couples of frames), only 
the decoded parts corresponding to the fourth couple of frames C3 being stored : the subbands 
LLLO, LLHO, LH1 and H3 (see Fig.6). Similarly, the dotted Information of Fig.6 (LLLO, LLHO, LL1, 
LH1) can be reused from the third decoding step. The following inverse operations are 
performed : 

- the decoded subbands LLLO and LLHO are used to synthesize the subband LL1 ; 

- said synthesized subband LL1 and the decoded subband LH1 are used to 

synthesize the subband L3 ; 

- said synthesized subband L3 and the decoded subband H3 are used to 

reconstruct the two frames F7, F8 of the couple of frames C3. 

This procedure is repeated for all the successive GOFs of the video sequence. 
When decoding the coded bitstream according to this procedure, at most two frames (for 
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example : Fl, F2) and four subbands (with the same example : HO, LHO, LLHO, LLLO) have to 
be stored at the same time, instead of a whole GOF. A drawback of that low-memory solution is 
however its complexity. The same input bitstream has to be decoded several times (as many 
times as the number of couples of frames in a GOF) in order to decode the whole GOF. 

SUMMARY OF THE INVENTION 

It is therefore a first object of the Invention to propose a coding method 
allowing to significantly reduce at the decoding side the memory space necessary to decode the 
3D subband encoded bitstream while avoiding the previous iterative solution. 

To this end, the invention relates to a video coding method such as defined in the 
introductory part of the description and which is further characterized in that, in the encoding 
step, the 2 n frequency subbands available at the end of the analysis step for each GOF are 
coded in an order that corresponds to a progressive reconstruction of the couples of frames of 
said GOF in their original order, the bits necessary to later decode the first couple of frames 
being at the beginning of the coded bitstream, followed by the extra bits necessary to decode 
the second couple of frames, and so on up to the last couple of frames of the current GOF. 

It is also an object of the invention to propose a transmittable video signal 
consisting of a coded bitstream generated by such a coding method, and a method for decoding 
said signal, using a reduced memory space with respect to the decoding method previously 
described. 

It is still an object of the invention to propose coding and decoding devices for the 
Implementation of said coding and decoding methods. 

BRIEF DESCRIPTION OF DRAWINGS 

The present invention will now be described, by way of example, with reference to 
the accompanying drawings in which : 

- Rg.l illustrates a 3D subband decomposition, performed in the present case on a 
group of eight frames ; 

- Fig.2 shows, among the subbands obtained by means of said decomposition, the 
subbands that are transmitted and the bitstream thus formed ; 

- Figs 3 to 6 Illustrate, in a decoding method already proposed by the applicant, the 
operations Iteratively performed for decoding the Input coded bitstream ; 

- Fig.7 illustrates the basic principle of a video coding method according to the 

invention ; 

- Figs 8 to 10 show respectively the three successive parts of a flowchart that 
illustrates an implementation of the video coding method according to the invention ; 

- Fig. 11 illustrate a decoding method according to the Invention. 
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DETAILED DESCRIPTION OF THE INVENTION 

The principle of the invention is the following : the input bitstream is re-organized 
at the coding side in such a way that the bits necessary to decode the first two frames are at 
the beginning of the bitstream, followed by the extra bits necessary to decode the second 
couple of frames, followed by the extra bits necessary to decode the third couple of frames, etc. 
This solution according to the invention is illustrated in Fig.7, in the case of n=3 decomposition 
levels, but said solution is obviously applicable whatever the number n of these levels. At the 
output of the entropy coder 21, the available bits b are now organized in bitstreams BSO, BS1, 
BS2, BS3 that respectively correspond to : 

- the subbands LLLO, LLHO, LHO, HO useful to reconstruct at the decoding side the 

couple of frames CO ; 

- the extra subband HI, useful (in association with the subbands LLLO, LLHO, LHO 

already put In the bitstream) to reconstruct the couple of frames CI ; 

- the extra subbands LH1, H2 useful (in association with the subbands LLLO, LLHO 
already put In the bitstream) to reconstruct the couple of frames C2 ; 

- the extra subband H3, useful (in association with the subbands LLLO, LLHO, LH1 
already put in the bitstream) to reconstruct the couple of frames C3. 

As indicated, these elementary bitstreams BSO to BS3 are then concatenated In 
order to constitute the global bitstream BS which will be transmitted. In said bitstream BS, it 
does not mean that the part BS1 (for example) is sufficient to reconstruct the frames F3, F4 or 
even to decode the associated subband HI. It only means that with the part BSO of the 
bitstream, the minimum amount of Information needed to decode the first two frames Fl, F2 
(couple CO) is available, then that with said part BSO and the part BS1, the following couple of 
frames CI can be decoded, then that with said parts BSO and BS1 and the part BS2, the 
following couple of frames C2 can be decoded, and then that with said parts BSO, BS1, BS2 and 
the part BS3, the last couple of frames C3 can be decoded (and so on, in the general case of 2" 

couples of frames in a GOF). . 

With mis re-organized bitstream, the multiple-pass decoding scheme as previously 
proposed is no longer necessary. The coded bitstream has been organized in such a way that, 
at the decoding side, every new decoded bit is relevant for the reconstruction of the current 
frames. 

An implementation of the video coding method according to the invention is 
Illustrated in the flowchart of Figs 8 to 10. As illustrated in Flg.8 with the references 81 to 85, 
the current GOF (81) comprises N = 2 n frames AO, Al, A2,..., A(N-1) which are organized (82) in 
successive couples of frames CO = (AO, Al), CI = (A2, A3),..., C((N/2)-l) = (A(N-2), A(N-1)). At 
the first temporal level TL1, the temporal filtering step TF is first performed on each couple of 
frames (84), which leads to outputs TF(C0) = (L[1,0], H[1,0]), TF(C1) = (L[l,l], H[l,l]), ... , 
TF(C((N/2)-l)) = (L[l,((N/2)-l)], H[l, ((N/2)-2)]), in which L[.] and H[.] designate the low 
frequency and high frequency temporal subbands thus obtained. An updating step 85 then 




allows to store the logical indication of a connection between each couple of frames CO, CI, 
etc., and each subband that contains some information on the concerned couple of frames. 
These connections between a given couple of frames and a given subband is indicated by 
logical relations of the type : 

LCl,0]_IsUnkedWith_CO = TRUE 

H[l,0]_IsUnkedWith_CO = TRUE 

L[l,l]_IsUnkedWith_Cl = TRUE 

H[l,l]_IsLinkedWith_Cl = TRUE 

(said logical relations have been previously initialized in the step 83 : "for all temporal subbands 
S, for all couples C, SJsLinkedWith_C = FALSE"). 

As illustrated in Fig.9 with the references 91 to 98, the subband decomposition can 
then take place, between the operation 91 called jt = 1 (= beginning of the first temporal 
decomposition level) and the operation 95 called jt = jt+1 (= control of the following temporal 
decomposition level, according to the feedback connection Indicated in Fig.9 and activated only 
if, after a test 96, jt is lower than a predetermined value jtunax correlated to the number of 
frames within each GOF). At each temporal decomposition level, new couples K are formed (92) 
with the L subbands, according to the relations : 

KO = (LQt, 0], L [Jt, 1]) 

Kl = (L0t,2],LDt,3]) 



and a temporal filtering step TF is once more performed (93) on these new K couples : 
TF(K0) = (Lfo't+1, 0], H Dt +1, 0]) 
TF(K1) = (Lfjt+1, 1], H fjt+1, 1]) 



An updating step (94) is then provided for establishing a connection between each of the 
subbands thus obtained and the original couples of frames, i.e. for determining if a given 
subband will be involved or not at the decoding side In the reconstruction of a given couple 
of frames of the current GOF. At the end of the temporal decomposition, the following 
subbands : 

L(jt_max, n), for n = 0 to N/2 Jt , 

HO't, n), for jt = 1 to jt_max and n = 0 to N/^* 1 ), 
which correspond to the subbands to be transmitted, are extracted (97). This ensemble is called 
T in the following part of the description. A spatial decomposition of said subbands is then 
performed (98), and the resulting subbands are finally encoded according to the flowchart of 
Fig.10, in such a way that the output coded bltstream BS (such as shown in Rg.7) is finally 
obtained. 

After an entropy coding step 110, a control (111) of the bit budget level is 
performed at the output of the encoder. If the bit budget is not reached, the current output bit 
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b is considered (112), n is initialized (113), and a test 115 is performed on a considered 
subband S (114) from the ensemble T. If b contains some information about S (115) and if S is 
linked with the couple Cn (116), the concerned bit b is appended (117) to the bitstream BSn (n 
= 0, 1, 2, 3 in the ©(ample previously given with reference to Figs 1 to 7) and the following 
output bit b is considered (l.e. a repetition of the steps 111 to 117 is carried out). If b does not 
contain any information about S, or if S is not linked with the couple Cn, the next subband S is 
considered (118). If all subbands in T have not been considered (119), the operations (115 to 
118) are further performed. If all said subbands have been parsed, the value of n is increased 
by one (120), and the operations (114 to 120) are further performed for the next original couple 
of frames (and so on, up to the last value of n). At the output of the coding step 110, if the bit 
budget has been reached, no more output b is considered. 

Finally, when all output bits have been considered or if the bit budget has been 
reached (111), the whole coding step is considered as achieved and the individual bitstream 
BSn obtained are concatenated (130) into the final bitstream BS (from n=0 to its maximum 
value). At the decoding side, the decoding step is performed as now explained with reference to 
Fig.ll, where "state 0" (1, 2,...,n) means that the functioning of the entropy encoder is 
constrained by the reconstruction of a unique couple, CO in the present case (CO, CI, C2,....,Cn 
in the general case) with n = 0 to 3 in the illustrated example. In practice, when a bit b of the 
coded bitstream is received and decoded, it is interpreted as containing some pixel significance 
(or set significance) information related to a pixel In a given spatio-temporal subband (or to 
several pixels in a set of such subbands). If none of these subbands contributes to the 
reconstruction of the current couple of frames Cn (CO in the illustrated example), the bit b has 
to be re-interpreted, the entropy decoder jumping to its next state until b is interpreted as 
contributing to the reconstruction of Cn (CO in the present case). And so on for the next bit, 
until the current sub-bitstream is completely decoded. 

The described functioning of the decoding of the first couple CO (state "0") is 
therefore fairly straightforward with the above explanations, and Fig.l shows dearly the 3D 
subband spatio-temporal synthesis of the couple of frames CO : at the third decomposition level 
jt=3, the subbands LLL0 and LLH0 are combined (dotted lines), with motion compensation 
(arrow), to synthesize the appropriate subband LL0 of the second decomposition level jt=2, said 
subband LL0 and the subband LH0 are in turn combined, with motion compensation, to 
synthesize the appropriate subband L0 of the first decomposition level jt=l, and said subband 
L0 and the subband HO are in turn combined, with motion compensation, to synthesize the 
concerned couple of frames CO (jt=0). More generally, if the size of the complete GOF is N = 2", 
(n+1) temporal subbands (one low frequency temporal subbands and n high frequency 
temporal subbands) have to be decoded and (n-1) low frequency temporal subbands have to be 
reconstructed, which corresponds to a noticeable reduction of memory space with respect to the 
case of the decoding and reconduction of the entire GOF at once. In the illustrated case, at 
each step, the reconstructed low frequency subband of the lower temporal level (e.g. LU), at 
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jt=2) Is written over the previous one (e.g. LLLO, at jt=3), that gets lost. Thus there are never 
more than (n+1) temporal subbands stored in memory. 
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CLAIMS: 

1- A three-dimensional (3D) video codfng method for the compression of a bitstream 

corresponding to an original video sequence that has been divided Into successive groups of 
frames (GOFs) the size of which is N = 2 n with n = 0, 1, 2,..., said coding method comprising 
the following steps, applied to each successive GOF of the sequence : 

a) a spatio-temporal analysis step, leading to a spatio-temporal muitiresolution 
decomposition of the current GOF into 2 n low and high frequency temporal subbands, said step 
itself comprising : 

- a motion estimation sub-step ; 

- based on said motion estimation, a motion compensated temporal filtering sub- 
step, performed on each of the 2 0 " 1 couples of frames of the current GOF ; 

- a spatial analysis sub-step, performed on the subbands resulting from said 
temporal filtering sub-step ; 

b) an encoding step, said step Itself comprising : 

- an entropy coding sub-step, performed on said low and high frequency temporal 
subbands resulting from the spatio-temporal analysis step and on motion vectors obtained by 
means of said motion estimation step ; 

- an arithmetic coding sub-step, applied to the coded sequence thus obtained and 
delivering an embedded coded bitstream ; 

said coding method being further characterized in that, in the encoding step, the 2 n frequency 
subbands available at the end of the analysis step for each GOF are coded in an order that 
corresponds to a progressive reconstruction of the couples of frames of said GOF in their 
original order, the bits necessary to later decode the first couple of frames being at the 
beginning of the coded bitstream, followed by the extra bits necessary to decode the second 
couple of frames, and so on up to the last couple of frames of the current GOF. 
2. A coding method according to claim 1, characterized in that, n being equal to (0, 1, 

2, 3), among the set of subbands available for the current GOF at the end of said analysis step 
and comprising the high frequency temporal subbands (HO, HI, H2, H3) of the first 
decomposition level, the high frequency temporal subbands (LHO, LH1) of the second 
decomposition level and the low and high frequency temporal subbands (LLLO, LLHO) of the 
third decomposition level, the subbands (LLLO, LLHO, LHO, HO) are first coded, then the 
subband HI, then the subbands (LH1, H2), and then the subband H3. 

3. A transmittable video signal consisting of a coded bitstream generated by a three- 
dimensional (3D) video coding method for the compression of a bitstream corresponding to an 
original video sequence that has been divided into successive groups of frames (GOFs) the size 
of which is N = 2 n with n = 0, 1, 2,..., said coding method comprising the following steps, 
applied to each successive GOF of the sequence : 
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a) a spatio-temporal analysis step, leading to a spatio-temporal multiresolutJon 
decomposition of the current 60F into 2" low and high frequency temporal subbands, said step 

itself comprising : 

- a motion estimation sub-step ; 

- based on said motion estimation, a motion compensated temporal filtering sub- 
step, performed on each of the 2* 1 couples of frames of the current GOF ; 

- a spatial analysis sub-step, performed on the subbands resulting from said 

temporal filtering sub-step ; 

b) an encoding step, said step itself comprising : 

- an entropy coding sub-step, performed on said low and high frequency temporal 
subbands resulting from the spatio-temporal analysis step and on motion vectors obtained by 
means of said motion estimation step ; 

- an arithmetic coding sub-step, applied to the coded sequence thus obtained and 

delivering an embedded coded bitstream ; 

said coding method being such that, In the encoding step, the 2" frequency 
subbands available at the end of the analysis step for each GOF are coded in an 
order that corresponds to a progressive reconstruction of the couples of frames of 
said GOF in their original order, the bits necessary to later decode the first couple 
of frames being at the beginning of the coded bitstream, followed by the extra 
bits necessary to decode the second couple of frames, and so on up to the last 
couple of frames of the current GOF. 
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Abstract 

The invention relates to a three-dimensional (3D) video coding method for the 
compression of a coded bitstream corresponding to an original video sequence that has been 
divided into successive groups of frames (GOFs) the size of which is N ~ 2 n with n - 0, 1, 2,... 
This coding method comprises the following steps, applied to each successive GOF : 

a) a spatio-temporal analysis step, itself comprising a motion estimation sub-step, a 
motion compensated temporal filtering sub-step and a spatial analysis sub-step ; 

b) an encoding step, itself comprising an entropy coding sub-step, performed on the low 
and high frequency temporal subbands resulting from the spatio-temporal analysis step and on 
motion vectors obtained by means of said motion estimation step, and an arithmetic coding 
sub-step, applied to the coded sequence thus obtained. 

According to the invention, the 2 n frequency subbands available at 
the end of the analysis step are coded in an order that corresponds to a 
reconstruction of the couples of frames in their original order, the bits necessary 
to decode the first couple being at the beginning of the coded bitstream, 
followed by the extra bits necessary to decode the second couple, and so on up 
to the last couple. 
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Initialization 

For all temporal subbands &, for all couples C 
S_IsLinkcdWith_C°FALSE 



(MQTemporal filtering TFofthe couples 
TF(Co) = (L,^ H u0 ) 
TF(C,) = (L w ,Ht.i) 



83 



85" 



Updating 

. Lu, IsIinkedWiflu ^TRUE 
H,^IsLmkedWitb_ Qr^TRUE 
• L, UkLinkedWith_. C,=TRUE 
H,',JsLiiikedWith_ C,-TRUE 
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Formation of new couples K with the L subbands 
K©==( L^, Lju) 
• Ki«=( I^, I^) 



33 



I 



(MC)TemporaI filtering of the co 



Updating 

For all subbands S of the current temporal level jt 
For all subbands P in the parent couple K. 
* For all original couples C 
S_IsIinkedWitb C 
- S_IsUnkedWith_ C j PJfcUnkedWitfa_ C 




Extraction of the following subbands 
(the transmitted subbands T) 

*^jt=l,-jt_max, n==0.-N/(2 il ) 



Spatial decomposition 
of the subbands in T 
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